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1. --^Jk-^^A-^mm^t^^^Jl:^^^^. ^^;3r^fe4t 
10. 1 ^ir>k, ^ffJ[it^^i^f^^—^t^^ 



1 




11. ^^M]-^^ 1 6^^^. ^-f^^^^nL^mn^n^^ 

13. ^mM']^^ 1 ^-ft^^^m^T^»]^^^M 

60 J. ^il 1^^^ ^ 60 

15. ;J|L#*L*JJI-^ 14 60;5r*, * t^Jt^^'^^^'CI.-^'KrT ft^^^Jt 

# A n A-^^^ l«l 60 ^3i4.^>l. 

16. #.#*i^'JJ^ijtl4 60^*, *t^5f.^'^^'Ci.#ibTi«tl-i"i*3 
^60'fr44^tii#^^60^^X;ii^'aL. 

17. #.|&*i^JJ^^14 60;r^, *t^^^^^iiL#ibTJL*>W 
19. 1 60:5r^, ^f^i^^$~^J^^^-^^^ML 

60-4^*^; ^ 

#iit4l;t-^^^^60^/^-Mi#iRj5'J>t#/-; ^ 
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^^^^ • « 



21. ^^M]^^20^-^^^tn^^. ^t^^l^^t^l^^^^ 

23. ;t|L|&*l^'J^^20 6^^^^t^^M., ^ t^^J^^t &^^>r 

24. ;ffL||*t5f>JJ^^ 20 ¥i^m^t ^ t/^^>t^>^t ?1 

26. #.#*cifj^^20 6^^^4^^^.^. *t/9r^^s^it-#se. 

27. #-#*l#'jJ^^20 6^^«;|^^f,i^, ^^^^^^^^-"^^ 

28. ;^#*l^'JJ^^20 6^^j^^t.^^^, ^ t^^^^^^^-fS^ 

29. ;t|L^*lifJJ^^29*^-|-«;i^t^.ife, %^ff\iS.^^^^^%l 

31. #.|&*LifJ^^29fit-|-«^^.?:^, ^t^^^^^WtbT 
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G.Salton ,Automatic Text Processing, Addison-Wesley, 1989. ^ ?1 4. 

-##i:;^:J»^^^^*l^^#EK.6^;^^J. t ?1 ^^l^lt(indexmg 
phase)^— M^^-^^^^^ii^^^. .fii^j^t R(h-^(retrieval phase) 



fJ^-^##:?;^H«.j5'J, ^^ly^sm^olcmt^ 

(tokenization), (iii)^^(morphological)^*r, JS- 

(iv)'6t ^ # 6t ^-^-^^ (stop-word)^'I ^JK^ JK^ . it 

#^i?>i^>^t.jft.'ftj?(r;#-M6^ S.Dharanipragada et al., "Audio-Indexing for 
Broadcast News," in Proc. SDR97, 1997. 



2 




5Lfi^^-Nj^ A(constramts) ?1 6^-fM^^^4gt^>^(^^^)^# 

Lambda ^>^L)H->t^A^##iJ'>fi^^^t-^^ f->e.^T^;l6^^-i:. 
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@ 1 l^^t?l:^^t#-^^ ( unigram) X>H^(^^)fi^-5^ 

^; 

>^E.® 1 f-ir^^i^^^xmii^-A-'^M^tn^^ 100. d^Tft^iat 

500, 

"t^^tn^^^OQ. ■kpT^f^^-fm^^. 500>j5E.t 
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• ••• 

•••• • 



^-m^/^s.^A. j^^^^^^^f. n-^A-'t 

Okapi ^ia.i:^^^A->h^:^^6^>^it. ^it^^t^^ 

^. Lambda A>^t;«t>t#Aii##i?-6t^^t-^^ "t^^^'f^^!^^^ 



s 




100 -l-^^tf^^ 100 T)':^^i«y->hii.^i1-^SM.:*:^^f^ 

110 ^;*S^fi^#'^^, :fc»lt#'J^^iS:-^ 120, '&T>':t^*k.^^^:^>ifc;^ 
J*.. ^tiS^ 110 T)5t'f^i(j->N#-^6^^^^:^^#^5il> ^:tA/L>h;^ 
Afe.il^^^>':^ffe^^l^^6^^S^. ^^^^i^^- 120 >S./^->h>^ 
^##§(ROM)^f'#'^-4^-^-Jt4-. 110 4^;^t, 

#t##^^4|- 120 M.^^^-A-'^mt^^^^ 150, 

iL ^ >h#.*l JSt 93 T t ?I 1 6^ ^ M ^iJl^ i:^ (il ?l 
^^). ^^K M^m^^ 120 200 ^-il 

^>h-;t#>^t?l5t:^ 300, T^^^^® 2 3 ^j5»Ji^^f#tfe-. lit, 

d^T^^^S 2AS1 2D^#t^6^, 200 &fe->hi:;^lt 

210(@ 2A), -&-iL^^^^t^\ 240(S 2B), ->h#-^^:s:# 
(t5)^)260O 2C)>':^;S.->^^#i:;iit?l(IDF)275(@ 2D). 

^m{^^m)3L^^iiL^-^^A^^H^. 300 

300j^#:^1^, 200 1 6^^^^^^^"^^^ 

^■^6^:iS^ES&#JL^>t-#At?l:si:'H^ 300 t>t#>^#^^^^^6^?t# 

i5 ^^^^^^ 120 fe^^^-f^^, :a^>^>R^#*ti£|l 110 
^^T^^^^^ 5it-##t&6^^?l^i^^^ 500 ^^T®#^'^S 6 

^liti:.^^^ sooy^^mt^^^^ 150 t-il^^-t^x^ii^^^^ 

^A;iBi«t>&6^i^^t5l^'H^ 200 ^Jt^A^SIiL'ff 300. 
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^^^^ • •••• s , , , 

•••• •• • •••• 

• ••• 



-l-M^t 600 r l^^t ?I 200 

300, #.#-f«l^^j^';t#A^tR^#^^-^^2^Sii» ^# 

i:^ 300. 

d^-t^^, 200 ^k^-^SL^^i^^ 210(@ 2A), - 

>hi:;|i>f|-^*t?l 240(® 2B), ->h#-4^^i:#(-isi^)260(S IQJii^ 
->h^#i:;^^llaDF)275(S 2D). il^ff 200 >S-lt>6^ 

^fe-ifllt, J^fit?Ii:#200>i§t^'fe^4tie'^T Okapi 

i:;iiit*;^ 210(® 2A)^^Tl^#-»e.^, -j^-l^x-je.^ 211 fj 214, 
#-t&|t4^5^;5f^^;5fe^t->^^^«*&^ 100 >h:N6^i:##^*^^. 

^it^^lal^ 50 )t^-f #>^^^ 220 fft 

^T^i^Si^^^^, ^^M^ 210 .^L^ 222 ^ 224 ^^iS']^^-^^?^^* 

210 Ii#->h4t#, ^^4^n^^^^^ik^^tmsi^ 

^.rn^^^t^] 240(® 2B)i^^T 241 244, 

7?:^m;5-*t, 240 100#-te.f:. ^^250 

255 1 4t ^ T 6^ . 
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J^Si^f¥j^$L±^ ^-^lail'ff 260 ^4^T^#-ie.f-, ^d^i&lt 261 

s«i 264, ^4kT^^n^^^'^^^^-^^nn^^. ^^26s^^^ 

#j#X#t?I 275(@ 2D)i^-iT^^lJ^SL^^^{^Mt^^)^^A- 

n'^kSL. ¥i^SL^t^\ 275 ^^T^^i&^, ^■k^iL^216n 279, # 

■^ii^T#t?x, 275>^d^ 285 t^t^T-^^, ^ 290 

#i:#^i^(IDF), ^ 295 AAia*i^6t[^^r3t#6^i^J^. ^ 295 t^^ 

t. 

>-t^^, S 3 ^^6^>t#>^t $1 300 ^-^^mX.'^^^^ 
;t^^#T->hJ^#A#^, Jt^^^t^l^t'ff 300 ^^T^^T&^, -fifll 
305 JI»J 312, #^t&|t^->h-^^it'fr6^;F^>t^^^. #4^^ 
^>i^I^^lS16^7t#>^;fel^. 325 t#iRT^>^;tlt. >1r#Al^?Ii: 
# 300 .^t^ 330 t#i"-T^>^fi^>t#>-, J^335;I.&^^>^IS:6^^J^6^ 

i5^K J^#A t §1 il-H^ 300 340 ^ 345 t^iS'J 

>f=^ 300 350 t&J^T-^hi^'flL, 5 Jp'Tft^-H-tfe- 

® 4^ti{T->he>^fi^^f-^#^^->e'>^#A.6^*ts^^. ^® 4 

it:^, 'ef>I:fei—>hJ^?tf^>e^i^^] (pulse-code modulated, PCM)^^^'^ 

^j^#>^^i5^3S 410. j^i^A^i&^S 410 xt>t#>^i>i|^i:'ftii#^^ 
;ft, ^>^#A^#>^ 420 t^^^>1^#A.'^J^t'-^^Z'^. #';t#>^6^^ 
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« • • • 
• • • 



^^6^, >i5t.#J|^510f'J 535^#I^^tlI^i^^^, 510 JS. 550 fll 

( cepstral feature) ^^m. 510 vX^^^^^fK^M^^'^^^^'k^, 
iit, 5l0#-i-^i:>fr6^m>&M^J^, ^'h^^^S#i4:4fislr^ 

i^J&-t■^^4'8t^^^--'^#^?l ^. ^J** ViaVioce #^i?>j5»J 
ili, '^^VXJK^^ Armonk fi^ IBM >'>^i'>tWiI^:^^#^J. >^#^ 515 

100>hT^6^i:###Jfet. 

i^^t5lit:# 200^i^6^M.-H-^4T^:t>^^#JSR 530>y.-|-Mi:^t^ 

J^. jfiXfiiT^ati^, (i)#te.>ffc(tokenization), (ii) 

#te.^^^^, (iii)#^(morphoIogical)^*r, A.{vf)^m^^^%^^^n 
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* • • • • 



530, t 500 iiii Okapi ^^#5'J^>fi^fi^i^^#"^. 
530 t^#^'f^4>^#4^ 535 200 

^>ffc, J^-^m. 550 'j!fr^)tt^MII^^-^^6^^-5?-(^#)^#l?.«3. #>^ 
# ^ 6^ # ii: # ^ ('IS. >t A j^"] ^ ^ 1 6^ It 'i ^ ;t -S: t 

(model-selection criterion) A;0t^.^^^iil't*r^4#>l (Bayesian 
Information Criterion, BIC). 

it^M^m^^^n^^. BIC ^#^m*r^^6^(>^#A)^#Bti:T^ 

^^fi^it^H:. >lg.&Ji^i^A-|-^'ft-f--|-ibfi^i!a(10ms)4:l*i:.i^#JL^- 
i*L^^*h3£6^^^. i?yT>^t4gi, l<=i<N, iLJ&>^E.->h N #^1^:1:6^ t 

i^*r^3g^^^4^tp^> ^#4£^-f '&i^^6^:t^^^^^-^^ii(full 
ccvariance) {ji, Z} • f^^A^^^Mf^-^fi^miii, Yi} ^^t'C'^j 

ABIC(i)=-R(i)+XP, 

i?0) = Ylog|Z|-flog|l.l-^log|Z.| 
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|gt8, N2= (N-i) d 6^ ^(dimension). S 

It 

ABIC<0 ^'Sltt, #^a3t'J^^?§>h^*rfi^^^^^ 
>(lL^fcl (thresholded-likelihood) ^t^^i^ ^# ^ ^^'J ^ 

( melcepstra ) tft. 

T*4»iE. BIC fi^if^^lil^^. ibi^B^^6^^^>gS#J^, BIC 

tcr30j^ifr M=N/r >h^>^IS:. >i§t(M-l)<^L BIC ^T^^^^f^ ^^^^ 
;S.ABIC<0 ^^{lL^dt6^->h. ^^>?^>j«£5l#->h$lfcil. t 

^, AN.;r-4^-tl^. §#«!ltniL*LBt, BIC fi^-H-^^fe-ltTJ^^^-lP- 
i^'h, dta^^-^6^^>>i -^^^M„„, M„„ .1/^1 BIC ■H-*^*^#- 
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^fm. 555 t, ^Mfm 550 i^^^^^^^m. 510 

H.S.M.Beigietalo "IBM Model-Based and Fram-By-Fram Speaker- 
Recognition, " in Proc. Of Speaker Recognition and Its Commercial 
and Forensic Applications, Avignon, France(1998). il.^, 

tfi60ii-|->^^-J. M,^^ i >h?t#A.ill^, ^ d 

^^^^^'^T. t*A^'Ac4-»-«*-^^'^*(Mean vector), -^i^rj.^ 
1^, ^-H-#t(Counts)ia;^l. 

ii.it'fit^ H.S.M. Beigi et. Al, " "A Distance Measure Between 
Collections of Distributions and Its Application to Speaker. 
Recognition, " Proc. ICASSP98, Seattle, WA, 1998 ^iii^^t^ 

E S&6^ j:i<fc^^(pass)JSl-*i*.^#), vXJSim 

#A^^(clustering). 
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J^*riS.^:^^(Gaussian Mixture Model, GMlWOd^ n, >h^:^6^)S.^*C 
d.. = -\og Y!j^iP'-^{f-\^^f^^ )] 
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O A«a 3 «« «' 

• 

* r« 44 •••• •« 

(audio clip)6^^i^Btl^6^A-^:fejtBtl«l. ^— >^#^(>j5t^^^.^l«I^^ 

J^fm. 575 til-^4#L#^>^t->^';t#At ?li:# 300 t. 
600 t ?1 ?! 

>^t#rt8t, l^^j^?t#A-|-j!S5^^t^^ 600>^#5^610^20^ 

?1 200 t ?1 X>H^ 300 ;^fm. 620 

JSfL 625 X^mi^^-A-^i^^^n^S^T^. 

^, ^i^JH^ 635 J«t^i<5l^^^ft^t?l ^^^'ff 200 Stmt^^J,^M-^n^ 
^^m^(^^yL^i^iM.)i^^^A^^^y>i^. 645), 
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a,=0.5, 02=0.5). #.7^-^. idf ^SiJL^'^i^^Stf^n^T^^^^^^ 
Okapi y^^^if^l'^fi^'^^^:^^^^' ^i^M^^-^it^M^l^^ 

jbm^^ifcii^tfi^^. 





IS 



• • • • « « » mm , 



4, ^tt-f'&^^'JI^^tiE^-^^fi^St-^il^, itit>&^ Java ^^ital 
^it^^m, v:*^^^!!!^^ VCR 6^^tf»^^ MPEG-1 Java 

(next-best guesses)ii4f r:^- T^X^it— #^4:^^^?! 
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